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TITLE OF THE INVENTION 
NUCLEIC ACID INTEGRATION IN EUKARYOTES 



CROSS-REFERENCE TO RELATED APPLICATION 
[0001] This application is a continuation of PCT International Patent Application No. 
PCT/NL/01/00936, filed on December 21, 2001, designating the United States of America, and 
published, in English, as PCT International Publication No. WO 02/052026 on July 4, 2002, the 
contents of the entirety of which is incorporated by this reference. 

TECHNICAL FIELD 

[0002] The invention relates generally to the field of molecular biology and cell 
biology. It particularly relates to methods to direct integration towards homologous 
recombination and uses thereof. 

BACKGROUND 

[0003] Several methods are known to transfer nucleic acids to, in particular, eukaryotic 
cells. In some methods, the nucleic acid of interest is transferred to the cytoplasm of the cell; in 
some, the nucleic acid of interest is integrated into the genome of the host. Many different 
vehicles for transfer of the nucleic acid are known. For different kinds of cells, different systems 
can be used, although many systems are more widely applicable than just a certain kind of cells. 
In plants, e.g., a system based on Agrobacterium tumefaciens is often applied. This system is 
one of the systems that are used in a method according to the invention. 

[0004] State of the Art: The soil bacterium Agrobacterium tumefaciens is able to 
transfer part of its tumor-inducing (Ti) plasmid, the transferred (T-) DNA, to plant cells. This 
results in crown gall tumor formation on plants due to expression of owogenes, which are 
present on the T-DNA. Virulence (vir) genes, located elsewhere on the Ti-plasmid, mediate T- 
DNA transfer to the plant cell. Some Vir proteins accompany the T-DNA during its transfer to 
the plant cell to protect the T-DNA and to mediate its transfer to the plant nucleus. Once in the 
plant nucleus, the T-DNA is integrated at a random position into the plant genome (reviewed by 
Hooykaas and Beijersbergen, 1994, and Hansen and Chilton, 1999). Removal of the one-genes 
from the T-DNA does not inactivate T-DNA transfer. T-DNA, disarmed in this way, is now the 
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preferred vector for the genetic modification of plants. 

[0005] Although much is known about the transformation process, not much is known 
about the process by which the T-DNA is integrated into the plant genome. It is likely that plant 
enzymes mediate this step of the transformation process (Bundock et al., 1995). The integration 
pattern of T-DNA in transformed plants has been extensively studied (Matsumoto et al., 1990; 
Gheysen et al., 1991; Meyerhofer et al., 1991). The results indicated that T-DNA integrates via 
illegitimate recombination (IR) (also called nonhomologous recombination; both terms may be 
used interchangeably herein), a process which can join two DNA molecules that share little or no 
homology (here the T-DNA and plant target DNA). Even T-DNA molecules in which a large 
segment of homologous plant DNA was present integrated mainly by IR and only with very low 
frequency (1:10 4 -10 5 ) by homologous recombination (HR) (Offringa et al., 1990). 

[0006] Recently, it was shown that Agrobacterium, is not only able to transfer its 
T-DNA to plant cells, but also to other eukaryotes, including the yeast S. cerevisiae (Bundock et 
al., 1995) and a wide variety of filamentous fungi (de Groot et al., 1998). In S. cerevisiae, T- 
DNA carrying homology with the yeast genome integrates via HR (Bundock et al., 1995). 
However, T-DNA lacking any homology with the S. cerevisiae genome becomes integrated at 
random positions in the genome by the same IR process as is used in plants (Bundock and 
Hooykaas, 1996). Apparently, eukaryotic cells have at least two separate pathways (one via 
homologous recombination and one via nonhomologous recombination) through which nucleic 
acids (in particular, of course, DNA) can be integrated into the host genome. The site of 
integration into a host cell genome is important with respect to the likelihood of transcription 
and/or expression of the integrated nucleic acid. The present invention provides methods and 
means to direct nucleic acid integration to a predetermined site through steering integration 
towards the homologous recombination pathway. The present invention arrives at such steering 
either by enhancing the HR pathway or by inhibiting (meaning reducing) the IR pathway. 

[0007] Host factors involved in the integration of nucleic acid by IR have not so far 
been identified. The present invention discloses such factors which enables the design of 
methods for their (temporary) inhibition, so that integration of nucleic acid by IR is prevented or 
more preferably completely inhibited, shifting the integration process towards HR and 
facilitating the isolation of a host cell with nucleic acid integrated by HR at a predetermined site. 
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This is extremely important, since there is no method available yet for easy and precise genetic 
modification of a host cell using HR (gene targeting). Of course, the actual site of integration is 
then determined by homology of the nucleic acid of interest with the site. 

BRIEF SUMMARY OF THE INVENTION 
[0008] In a first embodiment, the invention provides a method of directing integration 
of a nucleic acid of interest to a predetermined site, whereby the nucleic acid has homology at or 
around the predetermined site, in a eukaryote with a preference for nonhomologous 
recombination comprising steering an integration pathway towards homologous recombination. 
Preferably, such a method comprises at least the steps of introducing the nucleic acid of interest 
to a cell of the eukaryote, for example, by the process of transformation or electroporation, and 
integration of the nucleic acid in the genetic material of the cell. Integration is a complex 
process wherein a nucleic acid sequence becomes part of the genetic material of a host cell. One 
step in the process of nucleic acid integration is recombination; via recombination, nucleic acid 
sequences are exchanged or inserted and the introduced nucleic acid becomes part of the genetic 
material of a host cell. In principle, two different ways of recombination are possible: 
homologous and illegitimate or nonhomologous recombination. Most (higher) eukaryotes do 
not, or at least not significantly, practice homologous recombination, although the essential 
proteins to accomplish such a process are available. One reason for this phenomenon is that 
frequent use of homologous recombination in (higher) eukaryotes could lead to undesirable 
chromosomal rearrangements due to the presence of repetitive nucleic acid sequences. To 
accomplish homologous recombination via a method according to the invention, it is important 
to provide a nucleic acid which has homology with a predetermined site. It is clear to a person 
skilled in the art that the percentage of homology and the length of homologous regions play an 
important role in the process of homologous recombination. The percentage of homology is 
preferably close to 100%. A person skilled in the art is aware of the fact that lower percentages 
of homology are also used in the field of homologous recombination but dependent on, for 
example, the regions of homology and their overall distribution, which can lead to a lower 
efficiency of homologous recombination but are still useful and, therefore, included in the 
present invention. Furthermore, the length of a nearly homologous region is approximately 3 kb, 
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which is sufficient to direct homologous recombination. At least one homologous region is 
necessary for recombination but, more preferably, two homologous regions flanking the nucleic 
acid of interest are used for targeted integration. The researcher skilled in the art knows how to 
select the proper percentage of homology, the length of homology and the amount of 
homologous regions. By providing such a homology, a nucleic acid is integrated at every 
desired position within the genetic material of a host cell. It is clear to a person skilled in the art 
that the invention as disclosed herein is used to direct any nucleic acid (preferably DNA) to any 
predetermined site as long as the length of homology and percentage of homology are high 
enough to provide/enable homologous recombination. A predetermined site is herein defined as 
a site within the genetic material contained by a host cell to which a nucleic acid with homology 
to this same site is integrated with a method according to the invention. It was not until the 
present invention that a nucleic acid is integrated at every desired position and, therefore, a 
method according to the invention is applied, for example, to affect the gene function in various 
ways, not only for complete inactivation but also to mediate changes in the expression level or in 
the regulation of expression, changes in protein activity or the subcellular targeting of an 
encoded protein. Complete inactivation, which usually cannot be accomplished by existing 
methods such as antisense technology or RNAi technology (Zrenner et al., 1993), is useful, for 
instance, for the inactivation of genes controlling undesired side branches of metabolic pathways, 
for instance, to increase the quality of bulk products such as starch, or to increase the production 
of specific secondary metabolites or to inhibit formation of unwanted metabolites. A method 
according to the invention is also used to inactivate genes controlling senescence in fruits and 
flowers or that determine flower pigments. Replacement of existing regulatory sequences by 
alternative regulatory sequences is used to alter expression of in situ modified genes to meet 
requirements (e.g., expression in response to particular physical conditions such as light, drought 
or pathogen infection, or in response to chemical inducers, or depending on the developmental 
state (e.g., in a storage organ, or in fruits or seeds) or on tissue or cell types). Also, a method 
according to the invention is used to effectuate predictable expression of transgenes encoding 
novel products, for example, by replacing existing coding sequences of genes giving a desired 
expression profile by those for a desired novel product. For example, to produce proteins of 
medicinal or industrial value in the seeds of plants, the coding sequence of a strongly expressed 
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storage protein may be replaced by that of the desired protein. As another example, existing 
coding sequences are modified so that the encoded protein has optimized characteristics, for 
instance, to make a plant herbicide tolerant, to produce storage proteins with enhanced 
nutritional value, or to target a protein of interest to an organelle or to secrete it to the 
extracellular space. As yet another example, eukaryotic cells (including yeast, fungus, plant, 
mammalian cells or nonhuman animal cells) are provided with a gene encoding a protein of 
interest integrated into the genome at a site which ensures high expression levels. As another 
example, the nucleic acid of interest can be part of a gene delivery vehicle to deliver a gene of 
interest to a eukaryotic cell in vitro or in vivo. In this way, a defective p53 can be replaced by an 
intact p53. In this way, a tumoricidal gene is delivered to a predetermined site present only in, 
e.g., proliferating cells, or present only in tumor cells, for example, to the site from which a 
tumor antigen is expressed. Gene delivery vehicles are well known in the art and include 
adenoviral vehicles, retroviral vehicles, nonviral vehicles such as liposomes, etc. As another 
example, the invention is used to produce transgenic organisms. Knockout transgenics are 
already produced by homologous recombination methods. The present invention improves the 
efficiency of such methods. Also, transgenics with desired properties are made. It is clear to a 
person skilled in the art that transgenics can, for example, be made by the use of Agrobacterium 
as a gene delivery vehicle for plant (Vergunst et al., 1998), yeast (Bundock et al., 1995), fungus 
(de Groot et al., 1998) or animal (Kunik et al., 2001) or by direct DNA delivery methods 
exemplified by, but not restricted to, electroporation for yeast (Gietz & Woods, 2001), plant 
(D'Halluin et al., 1992; Lin et al., 1997), fungus (Ozeki et al., 1994) and animal (Templeton et 
al., 1997), LiCl treatment for yeast (Schiestl et al., 1993), microinjection for plant (Schnorf et al., 
1991) and animal (Capecchi, 1980) and "DNA whiskers" for plant (Kaeppler et al., 1990; 
Dunwell, 1999) or particle bombardment for plants and animals (Klein et al., 1992). It is, 
furthermore, clear that transgenic plants can be obtained via selective regeneration of 
transformed plant cells into a complete fertile plant (Vergunst et al., 1998) or via nonregenerative 
approaches by transforming germ line cells exemplified by, but not restricted to, dipping 
Arabidopsis flowers into an Agrobacterium suspension (Bechtold et al., 1993). It is also clear 
that transgenic animals can be obtained by transforming embryonic stem cells with one of the 
DNA delivery methods mentioned above (Hooper, 1992). 
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[0009] In another embodiment, the invention provides a method of directing nucleic 
acid integration to a predetermined site, whereby the nucleic acid has homology at or around the 
predetermined site, in a eukaryote with a preference for nonhomologous recombination 
comprising steering an integration pathway towards homologous recombination by providing a 
mutant of a component involved in nonhomologous recombination. Methods to identify 
components involved in nonhomologous recombination are outlined in the present description 
wherein S. cerevisiae was used as a model system. To this end, several yeast derivatives 
defective for genes known to be involved in various recombination processes were constructed 
and the effect of the mutations on T-DNA integration by either HR or IR was tested. The results 
as disclosed herein show that the proteins encoded by YKU70, RAD50, MRE11, XRS2, LIG4 and 
SIR4 play an essential role in DNA integration by IR but not by HR. WO 00/12716 describes a 
maize Ku70 orthologue and suggests that "Control of homologous recombination or 
nonhomologous end joining by modulating Ku provides the means to modulate the efficiency 
[sic, with] which heterologous nucleic acids are incorporated into the genomes of a target plant 
cell." WO 00/68404 describes a maize Rad50 orthologue and suggests an analogous control for 
RadSO. Both patent applications, however, do not disclose, in contrast to the present patent 
application, that by preventing or more preferably completely inhibiting nonhomologous 
recombination, for example, by providing a mutant of a component involved in nonhomologous 
recombination or by inhibiting such a component, the integration pathway is steered towards 
homologous recombination. It is clear to a person skilled in the art that different mutants of a 
component involved in nonhomologous recombination exist. Examples are deletion mutants, 
knockout (for example, via insertion) mutants or point mutants. Irrespective of the kind of 
mutant, it is important that a component involved in nonhomologous recombination is no longer 
capable or at least significantly less capable to perform its function in the process of 
nonhomologous recombination. As disclosed herein, disruption of YKU70, RADSO, MRE11, 
XRS2, LIG4 and SIR4 did not affect the frequency of DNA integration by HR, showing that these 
genes are not involved in DNA integration by HR, but only in DNA integration by IR. 
Moreover, in the wild-type yeast strain, 85% of the integration events occurred by HR (37% by 
replacement and 63% by insertion) and 1 5% by IR. In contrast, integration occurred only by HR 
in yeast strains lacking ku 70 or Ug4. In rad50 and xrs2 mutant strains, the T-DNA preferentially 
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integrated by HR (92%) and 93% of these T-DNAs integrated by replacement and only 7% by 
insertion. Thus, the absence of a functional rad50 or xrs2 gene leads to a significantly increased 
frequency of replacement reactions. 

[0010] In another embodiment, the invention provides a method of directing integration 
of a nucleic acid of interest to a subtelomeric and/or telomeric region in a eukaryote with a 
preference for nonhomologous recombination by providing a mutant of a component involved in 
nonhomologous recombination. A telomeric region is typically defined as a region containing 
repetitive sequences which is located at the end of a chromosome. A subtelomeric region is 
typically defined as a region flanking the telomeric region. As an example, it is disclosed herein 
that in yeast strains carrying disruptions of RAD50, MRE11 or XRS2, the distribution of 
integrated DNA copies is altered when compared to wild-type. DNA becomes preferentially 
integrated in telomeres or subtelomeric regions in the rad50, mrell and xrs2 mutants. A great 
advantage of integration of DNA copies in telomeres or subtelomeric regions instead of 
integration elsewhere in the genomic material is that there is no danger for host genes being 
mutated or inactivated by a DNA insertion. When in plants deficient for RAD50, MRE11 or 
XRS2, DNA copies also integrate into telomeres or subtelomeric regions. Such plants are used 
for subtelomeric targeting of T-DNA in transformation experiments to prevent additional 
insertion mutations from random T-DNA integration into the plant genome. 

[0011] In yet another embodiment, the invention provides a method of directing nucleic 
acid integration to a predetermined site, whereby the nucleic acid has homology at or around the 
predetermined site, in a eukaryote with a preference for nonhomologous recombination 
comprising steering an integration pathway towards homologous recombination by partially or 
more preferably completely inhibiting a component involved in nonhomologous recombination. 
Partial or complete inhibition of a component involved in nonhomologous recombination is 
obtained by different methods, for example, by an antibody directed against such a component or 
a chemical inhibitor or a protein inhibitor or peptide inhibitor or an antisense molecule or an 
RNAi molecule. Irrespective of the kind of (partial or more preferably complete) inhibition, it is 
important that a component involved in nonhomologous recombination is no longer capable or at 
least significantly less capable to perform its function in the process of nonhomologous 
recombination. In yet another embodiment, the invention provides a method of directing 
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integration of a nucleic acid of interest to a subtelomeric and/or telomeric region in a eukaryote 
with a preference for nonhomologous recombination by partially or more preferably completely 
inhibiting a component involved in nonhomologous recombination. Preferably, the component 
involved in nonhomologous recombination is radSO, mrell or xrs2. 

[0012] In a preferred embodiment, the invention provides a method of directing nucleic 
acid integration to a predetermined site or to a subtelomeric and/or telomeric region by providing 
a mutant of a component involved in nonhomologous recombination or by partially or more 
preferably completely inhibiting a component involved in nonhomologous recombination 
wherein the component comprises ku70, radSO, mrell, xrs2, Hg4, sir 4 or others such as ku80 
(Tacciole et al., 1994; Milne et al., 1996), lift (Teo and Jackson, 2000; XRCC4 in human, see 
FIG. 6; Junop et al., 2000) and nejl, (Kegel et al., 2001; Valencia et al., 2001). Components 
involved in nonhomologous recombination are identified as outlined in the present description. 
The nomenclature for genes as used above is specific for yeast. Because the nomenclature of 
genes differs between organisms, a functional equivalent or a functional homologue (for 
example, NBS1, a human xrs2 equivalent (Paull and Gellert, 1999) and see, for example, FIGS. 2 
to 5) and/or a functional fragment thereof, all defined herein as being capable of performing (in 
function, not in amount) at least one function of the yeast genes ku70, radSO, mrell, xrs2, lig4„ 
sir 4, ku80, lifl or nejl, are also included in the present invention. A mutant of a component 
directly associating with a component involved in nonhomologous recombination or partial or 
complete inhibition of a component directly associating with a component involved in 
nonhomologous recombination is also part of this invention. Such a component directly 
associating with a component involved in nonhomologous recombination is, for example, 
identified in a yeast two-hybrid screening. An example of a component directly associating with 
a component involved in nonhomologous recombination is KU80, which forms a complex with 
KU70. In a more preferred embodiment, the invention provides a method of directing nucleic 
acid integration in yeast, fungus, plant or nonhuman animal cells. 

[0013] In another embodiment, the invention provides a method of directing nucleic 
acid integration to a predetermined site, whereby the nucleic acid has homology at or around the 
predetermined site, in a eukaryote with a preference for nonhomologous recombination 
comprising steering an integration pathway towards homologous recombination by transiently 



9 



(partially or more preferably completely) inhibiting integration via nonhomologous 
recombination. In yet another embodiment, the invention provides a method of directing 
integration of a nucleic acid of interest to a subtelomeric and/or telomeric region in a eukaryote 
with a preference for nonhomologous recombination by transiently (partially or more preferably 
completely) inhibiting integration via nonhomologous recombination. In a more preferred 
embodiment, such a method is used for yeast, plant, fungus or nonhuman animal and the 
transient (partial or more preferably complete) inhibition is provided by a preferably stably 
inserted and expressed chimeric transgene that encodes a peptide inhibitory to one, some or all 
nonhomologous recombination (NHR) enzymes fused to a nuclear localization signal (Hanover, 
1992; Raikhel, 1992) and the steroid-binding domain of a steroid receptor (Picard et al., 1988). 
The chimeric transgene is constructed in such a way, using either heterologous or 
nonheterologous promoter sequences and other expression signals, that it provides stable 
expression in the target cells or tissue for transformation. In the absence of the steroid hormone, 
the steroid-binding domain binds to chaperone proteins, and thereby the fusion protein is retained 
in the cytoplasm. Upon treatment with the steroid hormone, the chaperones are released from 
the steroid-binding domain and the inhibitory peptide will enter the nucleus where it will interact 
with and inhibit the action of NHR enzymes. An example of an inhibitory peptide is a KU80 
fragment that imparts radiosensitivity to Chinese hamster ovary cells (Marangoni et al., 2000). 

[0014] In a more preferred embodiment, such a method is used for yeast, plant, fungus 
or a nonhuman animal and the transient (partial or more preferable complete) inhibition is 
provided by an Agrobacterium Vir-fusion protein capable of (partially or more preferably 
completely) inhibiting a component involved in nonhomologous recombination or capable of 
(partially or more preferably completely) inhibiting a functional equivalent or homologue thereof 
or capable of (partially or more preferably completely) inhibiting a component directly 
associating with a component involved in nonhomologous recombination. In an even more 
preferred embodiment, such an Agrobacterium Vir-fusion protein comprises VirF or VirE2. It 
was shown that the Agrobacterium VirF and VirE2 proteins are directly transferred from 
Agrobacterium to plant cells during plant transformation (Vergunst et al., 2000). To, for 
example, accomplish T-DNA integration by HR in plants, VirF-fusion proteins containing, for 
example, a peptide inhibitor of IR in plant cells are introduced concomitantly with the targeting 
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T-DNA. It has been reported that the C-terminal part (approximately 40 amino acids) of VirF or 
VirE2 is sufficient to accomplish transfer of T-DNA. A functional fragment and/or a functional 
equivalent of VirF or VirE is, therefore, also included in the present invention. Preferably, the 
nucleic acid of interest is delivered to a cell of the eukaryote by Agrobacterium. 

[0015] In an even more preferred embodiment, a component involved in 
nonhomologous recombination comprises ku70, radSO, mrell, xrs2, Hg4, sir 4, ku80, lifl or nejl 
or functional equivalents or homologue thereof or associating components. The nomenclature 
for genes as used above is specific for yeast. Because the nomenclature of genes differs between 
organisms, a functional equivalent or a functional homologue (see, for example, FIGS. 2 to 5) 
and/or a functional fragment thereof, all defined herein as being capable of performing (in 
function, not in amount) at least one function of the yeast genes ku70, rad50, mrell, xrs2, Hg4, 
sir '4, ku80, lifl or nejl, are also included in the present invention.' By transiently (partially or 
more preferably completely) inhibiting a component involved in nonhomologous recombination, 
a nucleic acid is integrated at any desired position without permanently modifying a component 
involved in nonhomologous recombination and preventing unwanted side effects caused by the 
permanent presence of such a modified component involved in nonhomologous recombination. 

[0016] Methods according to the present invention, as extensively but not limiting 
discussed above, are used in a wide variety of applications. One embodiment of the present 
invention is the replacement of an active gene by an inactive gene according to a method of the 
invention. Complete inactivation, which usually cannot be accomplished by existing methods 
such as antisense technology or RNAi technology, is useful, for instance, for the inactivation of 
genes controlling undesired side branches of metabolic pathways, for instance, to increase the 
quality of bulk products such as starch, to increase the production of specific secondary 
metabolites or to inhibit formation of unwanted metabolites, and to inactivate genes controlling 
senescence in fruits and flowers or to determine flower pigments. Another embodiment of the 
present invention is the replacement of an inactive gene by an active gene. One example is the 
replacement of a defective p53 by an intact p53. Many tumors acquire a mutation in p53 during 
their development which renders it inactive and often correlates with a poor response to cancer 
therapy. By replacing the defect p53 by an intact p53, for example, via gene therapy, 
conventional anticancer therapy has a better chance of succeeding. In yet another embodiment 
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of the invention, a therapeutic proteinaceous substance is integrated via a method of the 
invention. In this way, a tumoricidal gene is delivered to a predetermined site present only in 
e.g. proliferating cells, or present only in tumor cells, e.g., to the site from which a tumor antigen 
is expressed. In yet another embodiment, the invention provides a method to introduce a 
substance conferring resistance for an antibiotic substance to a cell. Also, a method according to 
the invention is used to confer a desired property to a eukaryotic cell. In a preferred 
embodiment, a gene delivery vehicle is used to deliver a desired nucleic acid to a predetermined 
site. Gene delivery vehicles are well known in the art and include adenoviral vehicles, retroviral 
vehicles, nonviral vehicles such as liposomes, etc. In this way, for example, a tumoricidal gene 
can be delivered to a predetermined site present only in, e.g., proliferating cells, or present only 
in tumor cells, e.g. to the site from which a tumor antigen is expressed. 

[0017] Furthermore, a method according to the invention is used to improve gene- 
targeting efficiency. Such a method is used to improve, for example, the gene-targeting 
efficiency in plants. In plants, transgenes integrate randomly into the genome by IR (Mayerhof 
et al., 1991; Gheysen et al., 1991). The efficiency of integration by HR is very low, even when 
large stretches of homology between the transgene and the genomic target site are present 
(Offringa et al., 1990). Therefore, the efficiency of gene targeting using HR is very low in 
plants. The results that are disclosed herein show how to improve the gene-targeting efficiency 
in plants. From the fact that T-DNA integration by IR is strongly reduced in KU70-, RAD50-, 
MRE11-, XRS2-, LIG4- and £tf?4-deficient yeast strains and T-DNA integration by HR is not 
affected in such strains, T-DNA integration by HR is more easily obtained in plants deficient for 
either of these genes. Recently, we have cloned a KU70 homologue of Arabidopsis thaliana (see 
FIG. 2, Bundock 2000, unpublished data). RAD50, MRE11 and LIG4 homologues have already 
been found in A. thaliana (GenBank accession numbers AF168748, AJ243822 and AF233527, 
respectively; see also FIGS. 3, 4 and 5 (Hartung and Puchta, 1999)). Currently, screenings are 
being performed to find plants carrying a T-DNA inserted in AtMREll, AtKU70 or AtLIG4. 
These knockout plants are used to test whether T-DNA integration by IR is reduced and 
integration by HR is essentially unaffected, thereby facilitating the detection of T-DNA 
integration by HR. 

[0018] Furthermore, the invention provides a method of directing integration of a 
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nucleic acid of interest to a predetermined site, whereby the nucleic acid has homology at or 
around the predetermined site, in a eukaryote with a preference for nonhomologous 
recombination, comprising steering an integration pathway towards homologous recombination, 
wherein the nucleic acid sequence of interest is essentially replacing a sequence within the 
eukaryote. As disclosed herein within the experimental part, in the wild-type yeast strain, 85% 
of the integration events occurred by HR (37% by replacement and 63% by insertion) and 15% 
by IR. In contrast, integration occurred only by HR in yeast strains lacking ku 70 or Ug4. In 
rad50 and xrs2 mutant strains, the T-DNA preferentially integrated by HR (92%) and 93% of 
these T-DNAs integrated by replacement and only 7% by insertion. Thus, the absence of a 
functional radSO or xrs2 gene leads to a significantly increased frequency of the desired 
replacement reactions. 

[0019] The invention will be explained in more detail in the following description, 
which is not limiting to the invention. 

BRIEF DESCRIPTION OF DRAWINGS 

[0020] FIG. 1 : Junction sequences of T-DNA and S. cerevisiae genomic DNA. & 
cerevisiae YPH250 (WT), radSO, mrell and xrs2 strains were cocultivated with 
LBA1119(pSDM8000). G4 18 -resistant colonies were obtained. Chromosomal DNA was 
isolated and subjected to Vectorette PCR to determine the sequence of genomic DNA flanking 
the T-DNA. The position of T-DNA integration was determined by basic BLAST search of the 
yeast genome at http:/www.genome-stanford.edu/SGD. The Watson strand of genomic DNA 
that is fused to the LB or RB is shown in italics. Bold sequences represent sequence homology 
between the LB and target site. The filler DNA sequence is underlined and depicted in italics. 
The numbers above the LB sequences represent the number of bp deleted from the LB. 
Tel. = telomeric, Subtel. = subtelomeric and Int. = intergenic. 

[0021] FIG. 2: Alignment of KU70 homologues. Sc = Saccharomyces cerevisiae, 
Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity is depicted as black boxes, 
homology is depicted as grey boxes and dashes are used to optimize alignment. 

[0022] FIG. 3: Alignment of LIG4 homologues. Sc = Saccharomyces cerevisiae, 
Hs=Homo sapiens and At = Arabidopsis thaliana. Perfect identity is depicted as black boxes, 
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homology is depicted as grey boxes and dashes are used to optimize alignment. 

[0023] FIG. 4: Alignment of MRE 11 homologies. Sc = Saccharomyces cerevisiae, Hs 
= Homo sapiens and At = Arabidopsis thaliana. Perfect identity is depicted as black boxes, 
homology is depicted as grey boxes and dashes are used to optimize alignment. 

[0024] FIG. 5: Alignment of RAD50 homologues. Sc = Saccharomyces cerevisiae, 
Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity is depicted as black boxes, 
homology is depicted as grey boxes and dashes are used to optimize alignment. 

[0025] FIG. 6: Alignment of XRCC4 homologues. Sc = Saccharomyces cerevisiae, 
Hs = Homo sapiens and At = Arabidopsis thaliana. 

DETAILED DESCRIPTION OF THE INVENTION 
EXPERIMENTAL PART 
Yeast strains. 

[0026] The yeast strains that were used are listed in Table 1. Yeast mutants isogenic to 
the haploid YPH250 strain were constructed using the one-step disruption method (Rothstein, 
1991). A 1987 bp fragment from the YKU70 locus was amplified by PCR using the primers 
hdflpl 5 '-GGGATTGCTTTAAGGTAG-3 ' and hdflp2 5'-CAAATACCCTACCCTACC-3\ The 
PCR product was cloned into pT7Blue (Novagen) to obtain pT7BlueITO70. An 1177 bp 
EcoRV/Hindlll fragment from the YKU70 ORF was replaced by a 2033 bp HindllVSmal 
Z^C/2-containing fragment from pJJ283 (Jones and Prakash, 1990), to form 
pT7BhjLeYKU70::LEU2. In order to obtain YKU70 disruptants, Leu + colonies were selected after 
transformation of YPH250 with a 2884 bp NdeVSmal fragment from pT7BlueYKU70::LEU2. 
The Expand™ High Fidelity System (Boehringer Mannheim) was used according to the supplied 
protocol to amplify a 3285 bp fragment from the LIG4 locus with primers dnl4pl 
5'-CGTAAGATTCGCCGAGTATAG-3' and dnl4p2 5 -CGTTTCAAATGGGACCAC AGC-3 
The PCR product was cloned into pGEMT (Promega), resulting in pGEMTZJG4. A 1326 bp 
BamWJXhol fragment from pJJ2 15 (Jones and Prakash, 1990) containing the HIS3 gene was 
inserted into the BamUl and Xhol sites of pIC20R, resulting in pIC20R///S3. A 782 bp EcoRl 
fragment from the LIG4 ORF was replaced with a 1367 bp EcoRI i/ZSJ-containing fragment 
from pIC20R/#S3 to construct pGEMTLIG4::HIS3. In order to obtain LIG4 disruptants, His + 
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colonies were selected after transformation of YPH250 with a 3854 bp NcoVNotl fragment from 
pGEMTLIG4::HIS3. In order to obtain RAD50 disruptants, YPH250 was transformed with an 
EcoRVBgM fragment from pNKY83, and Ura + colonies were selected (Alani et al., 1989). A 
rad50::hisG strain was obtained by selecting Ura" colonies on selective medium containing 
5-FOA. Similarly, RAD51 disruptants were obtained after transformation of YPH250 with a 
RAD51::LEU2 XbaVPstl fragment from pDG152 and selection of Leu + colonies (Schiestl et al., 
1994). The TRP1 marker in pSM21 (Schild et al., 1983) was replaced with a BglWXbal LEU2- 
containing fragment from pJJ283 (Jones and Prakash, 1990). This resulted in pSM21LEU2. 
Leu + RAD52 disruptant colonies were selected after transformation of YPH250 with the 
RAD52::LEU2 BarnHI fragment from pSM21 LEU2. Disruption constructs were transformed to 
YPH250 by the lithium acetate transformation method as described (Gietz et al., 1992; Schiestl 
et al., 1993). Disruption of YKU70, LIG4, RAD50, RAD51 and RAD52 was confirmed by PCR 
and Southern blot analysis. 
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[0027] Table 1: 


Yeast strains 




Strain 


Genotype 


Reference 


YPH250 


MATa, uraS-52, lys2-801, 
ade2-101, trpl-M, his3-A200, 
Ieu2-Al 


(Sikorski and Hieter, 1989) 


Y?m50rad51 


MATa, ura3-52, lys2-801, 
ade2-101, trpl-M, his3-A200, 
Ieu2-Al, rad51::LEU2 


This study 


Y?U250rad52 


MATa, ura3-52, lys 2-801, 
ade2-101, trpl-M, his3-A200, 
Ieu2-M, rad52::LEU2 


This study 


YFm50yku70 


MATa, ura3-52, lys2-801, 
ade2-101, trpl-M, his3-A200, 
Ieu2-Al, yku70::LEU2 


This study 


YPm50rad50 


MATa, ura3-52, lys 2-801, 
ade2-101, trpl-Al, his3-A200, 
Ieu2-Al, rad50::hisG 


This study 




jvifiiu, urtij-j^,, iy>j£-o\ji , 

ade2-101, trpl-Al, his3-A200, 
Ieu2-Al, Ug4::HIS3 


' 1 hie Qfiirlv 


JKM115 


Aho, Ahml::ADEl, MATa, 

\Iityiv ' A 1~}J? 1 ndpl 

LSI Iff II . . f\LsLLi 1 , UUZ, 1 , 

leu2-3,112, lys5, trpl::hisG, 
ura3-52 


(Moore and Haber, 1996) 


JKM129 


Aho, Ahml::ADEl, MATa, 
\hmr-ADFl ndpl 
leu2-3,112, lys5, trpl::hisG, 
ura3-52, xrs2::LEU2 


(Moore and Haber, 1996) 


JKM138 


Aho, Ahmlr.ADEl, MATa, 
Ahmr::ADEl, adel, 
leu2-3,112, lys5, trpl::hisG, 
ura3-52, mrell::hisG 


(Moore and Haber, 1996) 


YSL204 


Aho, HMLa, MATa, HMRa, 
adel -100, leu2-3,112, lys5, 
trpl::hisG, ura3-52, 
hisG'-URA3-hisG', sir4::HIS3 


(Leeetal., 1999) 
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Construction of binary vectors. 

[0028] To construct pSDM8000, a 1513 bp PvulVEcoRV fragment carrying the 
KanMX marker was obtained from pFA6a (Wach et aL, 1994) and was ligated into the unique 
Hpal site of pSDM14 (Offringa, 1992). pSDM8001 was made in three cloning steps. A 1476 bp 
BamHVEcoRI fragment carrying the KanMX marker was obtained from pFA6a and ligated into 
BamHI- and £coRI-digested pIC20H to form pIC20HfowMX. The KanMX marker was inserted 
between the PDA1 flanks by replacement of a 2610 bp Bgtll fragment from pUC4ElalO 
(Steensma et aL, 1990) with a 1465 Bglll fragment from pIC20HfowMX. A 3721 bp XhoVKpnl 
fragment from this construct was inserted into the Xhol and Kpnl sites of pSDM14. The binary 
vectors pSDM8000 and pSDM8001 were introduced into Agrobacterium tumefaciens LBA1119 
by electroporation (den Dulk-Ras and Hooykaas, 1995). 

Cocultivations / T-DNA transfer experiments. 

[0029] Cocultivations were performed as described earlier with slight modifications 
(Bundock et aL, 1995). Agrobacterium was grown overnight in LC medium. The mix of 
Agrobacterium and 5. cerevisiae cells was incubated for nine days at 20°C. G418-resistant S. 
cerevisiae strains were selected at 30°C on YPAD medium containing geneticin (200 jig/ml) 
(Life Technologies/Gibco BRL). 

Vectorette PCR. 

[0030] Chromosomal DNA was isolated using Qiagen's Genomic Tips G/20 per 
manufacturer's protocol. 1-2 [ig of Genomic DNA was digested with EcoRI, CM, Pstl or 
Hindll! and run on a 1% TBE-gel. Nonradioactive Southern blotting was performed. The 
membrane was hybridized with a digoxigenine-labeled kanMX probe to determine the size of 
T-DNA/genomic DNA fragments (EcoRl and Clal for RB-containing fragments and Pstl and 
Hindlll for LB-containing fragments). The kanMX probe, a 792 bp internal fragment of the 
KanMX marker, was made by PCR using primers kanmxpl 5 '- AGACTCACGTTTCGAGGCC- 
3' and fowmxp2 5'-TCACCGAGGCAGTTCCATAG-3 ' and a Nonradioactive DNA Labeling 
and Detection kit (Boehringer Mannheim). The enzyme showing the smallest band on blot was 
used for Vectorette PCR in order to amplify the smallest junction sequence of T-DNA and 
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genomic DNA. Vectorette PCR was performed as described 

(http://genomewww.stanford.edvl/group^otlab/protocols/vectorette.html)^ The Expand™ High 
Fidelity System (Boehringer Mannheim) was used to amplify fragments larger than 2.5 kb, 
whereas sTaq DNA polymerase (SphaeroQ) was used for amplification of fragments smaller 
than 2.5 kb. Primers fowmxp3 5 '-TCGC AGGTCTGC AGCG AGG AGC-3 ' and kanmxp4 
5'-TCGCCTCGACATCATCTGCCCAG-3' were used to amplify RB/genomic DNA and 
LB/genomic DNA junction sequences, respectively. 

T7 DNA Polymerase sequencing. 

[0031] Vectorette PCR products were cloned in pGEMTEasy (Promega) and sequenced 
using the T7 polymerase sequencing kit (Pharmacia) according to the manufacturer's protocol. 
In order to obtain sequences flanking the RB and LB, primers kanmxp5 
5'-TCACATCATGCCCCTGAGCTGC-3' and kanmxpA were used, respectively. 

RESULTS 

1 . Binary vectors for T-DN A transfer to yeast. 

[0032] It was previously demonstrated that Agrobacterium tumefaciens is able to 
transfer its T-DNA not only to plants but also to another eukaryote, namely, the yeast 
Saccharomyces cerevisiae (Bundock et al., 1995). T-DNA carrying homology with the yeast 
genome was shown to become integrated by homologous recombination. T-DNA lacking any 
homology with the yeast genome was integrated randomly into the genome by IR, like in plants 
(Bundock et al., 1995; Bundock and Hooykaas, 1996). The T-DNA used in these experiments 
carried the S. cerevisiae URA3 gene for selection of Ura + colonies after T-DNA transfer to the 
haploid yeast strain RSY12(URA3A). However, in this system, only yeast strains could be used 
in which the URA3 gene had been deleted to avoid homology between the incoming T-DNA and 
the S. cerevisiae genome. 

[0033] We wanted to set up a system in which T-DNA transfer to any yeast strain could 
be studied. Therefore, two new binary vectors were constructed using the dominant marker 
kanMX (Wach et al., 1994), which confers resistance against geneticin (G418). The T-DNA of 
pSDM8000 carries only the KanMX marker. Since this KanMX marker consists of heterologous 
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DNA, lacking any homology with the 5. cerevisiae genome, we would expect this T-DNA to 
integrate by ER at a random position in the yeast genome. To be able to compare this with 
T-DNA integration by homologous recombination, pSDM8001 was constructed. The T-DNA of 
pSDM8001 carries the KanMX marker flanked by sequences from the S. cerevisiae PDA1 locus. 
The PDA1 sequences have been shown to mediate the integration of T-DNA by HR at the PDA1 
locus on chromosome V (Bundock et al., 1995). 

[0034] Cocultivations between Agrobacterium strains carrying pSDM8000 and 
pSDM8001, respectively, and the haploid yeast strains YPH250 and JKM1 15, respectively, were 
carried out as described in the experimental part. G418-resistant colonies were obtained at low 
frequencies for YPH250 (1.6 x 10* 7 ) and JKM115 (1.2 x 10" 5 ) after T-DNA transfer from 
pSDM8000 (Table 2). T-DNA transfer from pSDM8001 -generated G418-resistant colonies at 
higher frequencies (2.4 x 10' 5 for YPH250 and 1.8 x 10" 4 JKM115, Table 2). The ratio of 
homologous recombination versus illegitimate recombination is determined by comparing the 
frequencies of G418-resistant colonies obtained from cocultivations using either pSDM8001 or 
pSDM8000. This showed that a T-DNA from pSDM8001 was 150-fold more likely to integrate 
than a T-DNA from pSDM8000 in YPH250 (Table 2). A similar difference was previously seen 
using T-DNAs with the URA3 marker (Bundock and Hooykaas, 1996). In contrast, T-DNA from 
pSDM8001 was only 16-fold more likely to integrate than a T-DNA from pSDM8000 in 
JKM1 15. There was no significant difference in the frequency of T-DNA transfer to these two 
yeast strains as was determined by T-DNA transfer experiments in which a T-DNA that carried 
the KanMX marker and the yeast 2 micron replicon was employed. Therefore, the differences in 
the frequencies of T-DNA integration by HR and IR between the yeast strains YPH250 and 
JKM1 15, respectively, most likely contributed to differences in the capacities of their HR and IR 
recombination machineries. 

[0035] We confirmed by PCR that the T-DNA from pSDM8001 became integrated at 
the PDA1 locus by homologous recombination (data not shown). In order to find out whether 
the T-DNA from pSDM8000 had integrated randomly by IR, yeast target sites for integration 
were determined from eight G418-resistant YPH250 colonies by Vectorette PCR (for detailed 
description see materials and methods). Chromosomal DNA was isolated and digested with a 
restriction enzyme that cuts within the T-DNA. A Vectorette was ligated to the digested DNA 
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and a PCR was performed using a T-DNA-specific primer and a Vectorette-specific primer. The 
PCR product obtained was cloned into pGEMTEasy and sequenced using a T-DNA-specific 
primer. The position of the T-DNA insertion was determined by basic BLAST search of the 
yeast genome (http://www-genome.stanford.edu/SGD). We were thus able to map the position 
of the T-DNA insertions of all eight G418-resistant colonies analyzed. They were present at 
different positions spread out over the genome. Comparison of the T-DNA sequence and yeast 
target site sequences did not reveal any obvious homology. These data show that the T-DNA 
from pSDM8000 had integrated via an IR mechanism as expected. 

[0036] The following characteristics have previously been observed for T-DNAs 
integrated by IR: a) the 3' end of the T-DNA is usually less conserved compared to the 5' end, 

b) microhomology is sometimes present between the T-DNA ends and the target site, 

c) integration is often accompanied by small deletions of the target site DNA (Matsumoto et al., 
1990; Gheysen et al, 1991; Mayerhofer et al., 1991; Bundock and Hooykaas, 1996). Similar 
characteristics were seen in the currently analyzed eight T-DNA insertions. In three strains, we 
observed microhomology of 2 - 6 bp between the LB and yeast target site (FIG. 1, WT.51 was 
taken as an example). In five strains, deletions of 1 - 5 bp of yeast target site DNA was found 
and we observed deletions, varying from 1 - 1 12 bp, of the 3' end of the T-DNA in seven out of 
eight analyzed strains. In only one strain, the 3' end appeared to be intact. The 5' end of the 
T-DNA was conserved in almost all strains. In only two strains, we observed small deletions of 
1 and 2 bp at the 5' end of the T-DNA. 

[0037] Thus, we can conclude that the T-DNA from pSDM8000 had integrated via the 
same IR mechanism described before. 
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[0038] Table 2: Frequencies of T-DNA integration by IR relative to integration by HR 
in recombination defective yeast strains 



Strain 


Genotype 


Freq. of Ir a 


Freq. of HR 


Absolute 

tt> /iur> ^.^i.: 

1K/HK ratio 


Standardized 
1K/HK ratio 


YPH250 


WT 


i a « 1 a~7 
1.6 x 1U 


z.4 X 1U 


a nm 


i 
1 


vnuo ca 


racljlA 


1.4 X 1U 


IjX 1U 


n no 


1 zi 

14 


lei 

rad51 












YPH250 


rad52A 


3.8 x 10 


O C .... 1 rv-0 

2.5 x 10 


A 1 C 

0.15 


23 


rad52 












YPH250 


yku70A 


<3.2 x 10" 9 


3.3 x 10" 5 


<0.0001 


<0.01 


yku70 












YPH250 


radSOA 


8.0 x 10" 9 


3.5 x 10" 5 


0.0002 


0.03 


rad50 












YPH250 


Ug4A 


3.7 x 10" 9 


2.3 x 10" 


0.0002 


0.02 














JKM115 


WT 


l.2x lO" 5 


1.8 x 10" 4 


0.07 


1 


JKM129 


xrs2A 


2.7 x 10" 7 


5.1 x 10" 5 


0.005 


0.08 


JKM138 


mrellA 


2.9 x lo -7 


7.5 x lO -5 


0.004 


0.06 


YSL204 


sir4A 


l.5x 10" 7 


1.8 x 10" 5 


0.008 


0.13 



Averages of two or more independent experiments are shown. Frequencies are depicted as 
the number of G418-resistant colonies divided by the output number of yeast cells 
(cells/ml). 

The frequency of T-DNA integration by IR (pSDM8000) divided by the frequency of 
T-DNA integration by HR (pSDM8001). 

The ratio of IR/HR in the mutant strain divided by the ratio of IR/HR in the wild-type 
strain. 
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2. Host-specific factors involved in random T-DNA integration. 

[0039] The observation that the T-DNA from pSDM8000 integrates by IR into the 
yeast genome allowed us to use this system to study the effect of host factors on the process of 
integration. Many proteins involved in various forms of DNA recombination have been 
identified in yeast. In order to determine the roles of a representative set of these enzymes in 
T-DNA integration, we compared T-DNA transfer and integration in wild-type yeasts with that 
of strains carrying disruptions of the genes encoding several recombination proteins. The 
RAD51, RAD52, KU70, RAD50 and LIG4 genes were deleted from YPH250 using the one step 
disruption method. Yeast strains carrying deletions in MRE11, XRS2 and SIR4 in the JKM1 15 
background were kindly provided by Dr. J. Haber. The results of cocultivations with these yeast 
strains are shown in Table 2. 

[0040] In rad51 and rad52 mutants, which are impaired in homologous recombination, 
the frequency of T-DNA integration by HR was sixteen- and nine- fold lower, respectively, than 
observed for the wild-type (Table 2). This showed that RAD51 and RAD52 play a role in 
T-DNA integration by homologous recombination. In the IR defective ku70, rad50, Ug4, mrell, 
xrs2 and sir4 mutants, the frequency of T-DNA integration by HR did not differ significantly 
from that observed for wild-type (Table 2). This showed that these genes do not play a role in 
T-DNA integration by homologous recombination. 

[0041] The frequency of T-DNA integration by IR in a radSl mutant did not differ 
significantly from that observed for wild-type, whereas in a rad52 mutant, the frequency was 
about two-fold higher (Table 2). This showed that RAD51 and RAD52 are not involved in 
T-DNA integration by IR. The product of the RAD52 gene may compete with IR-enzymes for 
the T-DNA and thereby inhibits integration by IR to some extent. Strikingly, in rad50, mrell, 
xrs2, Ug4 and sir4 mutants, the frequency of T-DNA integration by IR was reduced dramatically: 
20- to more than 40-fold (Table 2). T-DNA integration by IR seemed to be completely abolished 
in the ku 70 mutant. We did not obtain any G418-resistant colonies from several cocultivation 
experiments. This strongly suggests that KU70 plays an important role in random T-DNA 
integration in yeast. 
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[0042] Since T-DNA integration by HR is normal in these mutants, these results clearly 
show that the yeast genes KU70, RADSO, MRE11, XRS2, LIG4 and SIR4 are involved in T-DNA 
integration by illegitimate recombination. 

3. Chromosomal distribution of integrated T-DNA copies in IR-defective S. cerevisiae. 

[0043] From several cocultivation experiments with the radSO, mrell, xrs2, Ug4 and 
sir 4 mutants, we obtained a small number of G418-resistant colonies. The T-DNA structure was 
determined for a number of these lines. To this end, chromosomal DNA was isolated from these 
G418-resistant colonies and subjected to vectorette PCR to amplify junction sequences of 
genomic DNA and T-DNA. PCR products were cloned and sequenced. The yeast sequences 
linked to the T-DNA were used in a BLAST search at http://www-genome.stanford.edu/SGD to 
determine the T-DNA integration sites. 

[0044] Strikingly, analysis of LB/genomic DNA junctions revealed that in two out of 
three radSO, four out of six mrell and two xrs2 strains analyzed, T-DNAs had integrated in 
telomeres or subtelomeric regions (radSOkl, rad50k6, mrellk8, mrellk.ll, mrellk.14, 
mrellk.17, xrs2kl and xrs2kl7\ Table 3 and FIG 1). S. cerevisiae telomeres generally consist 
of one or more copies of the Y' element followed by telomerase-generated C(l-3)A/TG(l-3) 
repeats (Zakian, 1996). In two radSO strains, two mrell strains and one xrs2 strain, the LB was 
found to be fused to this typical telomerase-generated C(l-3)A/TG(l-3) repeat (radSOk.l, 
radSOk.6, mrellk.14, mrellk.17 and xrs2kl\ FIG. 1). Besides, we also found one T-DNA 
insertion in a Ty LTR element in the mrell mutant and two insertions in the rDNA region, 
present in chromosome XII, in the mrell and radSO mutants (mrell k.5, mrellk.4 and rad50k5, 
respectively; Table 3 and FIG. 1). 

[0045] The 3' end of the T-DNA was truncated in all strains. Deletions of 3-1 1 bp of 
the 3' end of the T-DNA were observed (FIG. 1). Microhomology between the 3' end of the 
T-DNA and yeast target site was only found in two lines (5 bp in mrellkA and 4 bp in 
mrellkl4\ FIG. 1). For the T-DNA copies present at the yeast telomeres, the RB/genomic DNA 
junction sequences could not be obtained from these strains using vectorette PCR. This was only 
possible for the rad50 and mrell strains carrying the T-DNA in the rDNA region on 
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chromosome XIL In both strains, the RB was intact and no homology between the 5' end of the 
T-DNA and the yeast target site was found (data not shown in FIG. 1). 

[0046] Previously, target sites for T-DNA integration in the genome of S. cerevisiae 
strain RSY12 were determined (Bundock and Hooykaas, 1996; Bundock, 1999). In four out of 
44 strains analyzed, T-DNA copies were integrated in rDNA, Ty LTR elements (in two strains) 
and a subtelomerically located Y' element, respectively. In addition, we determined the position 
of T-DNA integration in ten S. cerevisiae YPH250 strains. We did not find any T-DNA 
insertions in rDNA, LTR elements or subtelomeric/telomeric regions amongst these ten. Pooling 
all insertions analyzed in wild-type (54), in two out of 54 strains analyzed (4%), insertions were 
found in a Ty LTR element and in two other strains in the rDNA repeat (2%) and a subtelomeric 
region (2%), respectively. In contrast, we report here that T-DNA in yeast strains mutated in 
RAD50, MRE11 or XRS2 T-DNA integrates preferentially in (sub)telomeric regions (eight out of 
eleven lines: -73%) of rad50, mrell and xrs2 mutants (Table 3). From the remaining strains, 
two T-DNAs were present in rDNA and one in a Ty LTR element, respectively. Apparently, the 
rDNA repeat is also a preferred integration site in these mutants (-18% vs. -2% in the wild- 
type). 

[0047] Telomeres consist of a large array of telomerase- generated C(l-3)A/TG(l-3) 
repeats (-350 bp). In the subtelomeric regions, two common classes of Y' elements, 6.7 and 5.2 
kb, can be found (in most strains, chromosome I does not contain Y') (Zakian and Blanton, 
1988), making the average size of these regions -6.0 kb. Thus, the yeast genome contains (16 x 
2 x 0.35) + (15 x 2 x 6.0) = 191 kb of subtelomeric/telomeric sequences. The yeast genome is 
12,052 kb in size, which means that only 1.6% of the genome consists of subtelomeric/telomeric 
sequences. In accordance with this, we observed in only 2% of the wild-type strains T-DNA 
copies inserted in a subtelomeric region, which we would expect on the basis of random T-DNA 
integration. In contrast, in the radSO, mrell and xrs2 mutants, 73% of the T-DNA insertions 
were found in the (sub)telomeric region. 

[0048] Analysis of seven lines revealed that in the sir4 mutant T-DNA was integrated 
randomly into the yeast genome. So, although SIR4 has an effect on the efficiency of T-DNA 
integration by IR, the pattern of T-DNA distribution in the transformants seems similar as in the 
wild-type strain. In the sir4 mutant T-DNA, integration by IR was characterized by truncation of 
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the 3' end of the T-DNA, deletions at the target site and microhomology between the LB and the 
target site (data not shown); this was observed for T-DNA integration by IR in the wild-type. 

[0049] These results clearly show that in the rad50, mrell and xrs2 mutants, the 
T-DNA, if integrated at all, becomes preferentially inserted in telomeres or subtelomeric regions 
and that the genomic distribution of integrated T-DNAs is altered when compared to wild-type. 
However, disruption of SIR4 did affect the efficiency of T-DNA integration by IR but not the 
characteristics of this process. 

[0050] Table 3: genomic distribution of T-DNA integrated by IR in radSO, mrell and 
xrs2 mutants in comparison with the wild-type after T-DNA transfer from pSDM8000 



Yeast strain (Sub)Telomeric LTR rDNA Elsewhere 

region 

radSO mutant 2 0 1 0 

mrell mutant 4 110 

xrs2 mutant 2 0 0 0 

wild-type 1 2 1 50 
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